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ABSTRACT 

A  model  is  presented  for  the  prediction  of  future  organ- 
ization size  based  on  the  numbers  of  recruits  entering  the 
organization  in  the  past.   This  model  utilizes  the  correla- 
tion between  populations  of  successive  time  periods  in 
order  to  better  estimate  the  future  remaining  personnel  in 
the  system.   The  number  of  personnel  leaving  from  each 
recruit  cohort  is  assumed  to  follow  the  same  probability 
distribution,  which  is  a  function  of  the  age  of  the  cohort 
in  the  organization  and  the  grade  in  which  the  cohort 
started.   For  large  cohort  sizes  the  total  personnel  in  the 
system  is  approximately  normally  distributed.   This  result 
justifies  the  use  of  a  best  linear  prediction  method  which 
takes  into  account  past  errors  of  estimating  the  continuing 
population  from  one  period  to  the  next.   Sensitivity  of 
predictions  to  errors  in  probability  estimates  is  discussed 
The  model  is  applied  to  predicting  university  student 
enrollment.   Comparison  of  predicted  and  actual  student 


LIBRARY 

NAVAL  POSTGRADUATE  SCHOOE 

MONTEREY,    CALIF.    93940 

TABLE    OF    CONTENTS 

I.       INTRODUCTION    7 

II.      MODEL 11 

III.   MEETING  SPECIFIED  GOALS  22 

IV.   SENSITIVITY  TO  ERRORS  IN  PROBABILITY 

ESTIMATES 30 

V.   APPLICATION  TO  UNIVERSITY  ENROLLMENT  34 

APPENDIX  A:   REMAINDER  TERMS  TO  PREDICTION 

ERROR 40 

BIBLIOGRAPHY ■ 42 

INITIAL  DISTRIBUTION  LIST  43 

FORM  DD  1473 45 


LIST  OF  TABLES 

I.   New  Enrollments  at  the  University  of 

California,  Berkeley  35 

II.   Fractions  of  Students  Attending  Each 

Successive  Fall  After  Enrollment  36 

III.   Values  Used  in  Prediction 37 

IV.   Predicted  Total  Fall  Enrollment  39 


I.   INTRODUCTION 

In  many  large  corporations  and  institutions  with  a  high 
rate  of  personnel  turnover,  a  crucial  problem  in  recruitment 
planning  is  that  of  predicting  from  one  period  to  the  next, 
how  many  personnel  presently  in  the  organization  will 
remain.   When  the  length  of  service  of  a  single  member  is 
fixed,  or  completely  controlled  by  the  organization,  the 
problem  is  trivial.   However,  when  the  length  of  service 
is  variable,  such  as  with  middle  management  in  large 
corporations  and  the  military  service,  or  university  student 
bodies,  probabilistic  arguments  must  be  used  to  estimate 
expected  attrition. 

The  theory  of  Markov  chains  has  been  widely  used  in 
prediction  models.   A  basic  assumption  in  such  models  is 
one  of  stationarity  of  rates  of  movement  within  a  system 
of  defined  states.   In  order  to  assess  the  transition 
probabilities  between  various  states  in  the  system,  it  is 
necessary  to  identify  various  characteristics  of  personnel 
in  the  organization  in  the  past  to  make  predictions  in  the 
future.   A  number  of  such  models  can  be  found  in  the  liter-* 
ature,  including  Bartholomew  [1967] ,  and  Thonstad  [1968] . 

Of  particular  interest  is  a  paper  by  McAfee  [1970] , 
which  describes  a  different  type  of  prediction  model,  the 
so-called  cohort  model.   For  this  model  McAfee  tests  the 
stationarity  properties  of  the  distribution  of  the  remaining 


fraction  of  an  initial  cohort  size  in  subsequent  periods 
after  entry  into  the  organization.   This  was  done  for 
three  different  cohorts  which  entered  the  system  in  three 
adjacent  periods.   This  property  will  be  assumed  to  hold 
in  the  prediction  model  of  this  paper.   When  the  cohort 
sizes  vary  from  period  to  period,  McAfee  showed  that  the 
Markov  Models  alluded  to  above  do  not  accurately  describe 
movement  of  personnel  in  the  system.   Since  later  in  this 
paper  we  consider  new  cohort  sizes  as  control  variables, 
it  is  not  meaningful  to  consider  them  constant  in  size  over 
time,  and  hence  we  concentrate  in  this  paper  on  the  cohort 
model  and  analyze  some  of  its  characteristics. 

A  basic  assumption  of  this  cohort  model  is  that  all 
members  of  a  given  cohort  behave  independently  of  each 
other,  and  each  member's  lifetime  in  the  system  is  a  real- 
ization from  a  common  stochastic  process.   These  assumptions 
lead  to  the  binomial  distribution  for  predicting  the  con- 
tinuing fraction  of  a  cohort  with  a  given  age  in  the  system. 

The  cohort  prediction  model  views  the  number  of  per- 
sonnel  in  the  system  as  a  superposition  of  continuing 
portions  of  past  cohorts.   We  assume  that  the  behavior  of 
a  member  in  one  cohort  is  independent  of,  but  from  the  same 
distribution  as,  that  of  a  member  in  a  different  cohort. 
(By  the  word  different,  is  meant  different  time  of  entry 
into  the  system. )   To  estimate  the  continuing  portion  of 
the  present  organization  size  for  the  next  period,  we  sum 
the  continuing  portions  of  the  past  cohorts,  basing  the 
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expected  number  from  each  cohort  on  the  size  at  entry  and 
age  in  the  organization.   (viz.,  the  sum  of  expected  values 
of  independent  random  variables.) 

As  time  elapses  in  the  system  for  a  given  cohort,  the 
number  remaining  in  the  organization  from  one  period  to 
the  next  is  dependent  on  the  number  remaining  from  the 
previous  period.   There  exists  a  correlation,  which  is 
positive,  between  the  remaining  portion  of  a  single  cohort 
in  one  period  with  the  remaining  portion  of  that  same 
cohort  in  the  previous  period.   This  correlation  is  cumu- 
lative when  we  examine  the  correlation  between  the  sum  of 
continuing  portions  in  one  period  with  the  sum  of  the 
continuing  portions  of  the  past  period.   It  is  this  corre-r 
lation  property  of  the  model  which  will  be  of  special 
significance  in  improving  the  prediction  characteristics 
of  the  model. 

Another  property  of  the  model  which  will  be  proved  and 
used  to  advantage  is  that  with  large  cohort  sizes,  the 
distribution  of  the  sum  of  continuing  cohort  portions 
asymptotically  approaches  a  normal  distribution.   This 
allows  us  to  derive  simple  tractable  formulae  for  predicting 
the  number  in  the  system  in  a  given  time  period,  given  the 
number  present  in  the  previous  time  period,  with  no  detailed 
knowledge  of  from  which  cohort  the  various  members  came. 
Without  this  property,  the  exact  expression  for  the  expected 
value  of  the  organization  size  next  period,  given  the 


realization  of  the  present  size,  is  intractible.   A  best 
linear  predictor  for  the  expected  value  of  the  organization 
size  next  period  is  this  conditional  expectation  when  the 
organization  size  is  a  normal  random  variable. 

Mathematical  expressions  for  the  prediction  of  future 
organization  size,  knowing  the  size  of  each  past  and  present 
cohort  are  derived.   A  decomposition  of  the  organization 
into  grades  is  then  made  to  predict  future  grade  sizes 
within  the  organization.   Finally,  an  application  of  the 
model  is  made  to  the  university  student  enrollment  problem, 
where  the  desired  prediction  is  that  of  total  enrollment 
using  data  on  past  periods  for  new  enrollments.   This  appli- 
cation is  made  with  data  from  the  University  of  California, 
Berkeley,  during  the  period  1961  to  1969  •   For  this  model 
there  are  16  lifetime,  distributions  which  repeat  yearly; 
one  each  for  freshman,  sophomore,  junior  and  senior  new 
students  admitted  into  each  of  four  academic  quarters. 
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II.   MODEL 

Denote  by  X.(u)  the  number  of  persons  who  enter  an 
organization  at  time  u  in  state  i.   Let  K  be  the  number  of 
grades  in  the  organization  and  M  be  the  number  of  time 
periods  (epochs)  beyond  which  no  member  can  remain  in  the 
organization.   Let  the  probability  of  a  single  member 
remaining  until  at  least  s  epochs  have  elapsed  (since  entry 
into  the  system),  be  p.(s),  and  let  X.(u,s)  be  the  number 
who  entered  in  grade  i  at  time  u  who  are  still  in  the  organ- 
ization at  time  u+s .   Then  X.(t-s,s)  is  the  number  in  the 
system  at  time  t  of  those  that  entered  at  time  t-s  in  grade 
i,  a  binomially  distributed  random  variable  with  parameters 
X.(t-s)  and  p.(s).   The  expected  value  of  X.(t-s,s)  is 
p . (s ) «X. (t-s )  and  the  variance  is  X. ( t-s )p . (s )q . (s  )  ,  where 
q.(s)  is  l-p.(s).   When  the  time  elapsed  since  cohort  entry, 
s,  reaches  the  value  M,  the  expected  value  and  the  variance 
of  X.(t-MaM)  are  both  zero.   It  is  assumed  that  the  behavior 
of  members  of  a  cohort  which  entered  the  system  at  time  t, 
is  independent  of  those  in  a  cohort  which  entered  at  time 
u5  u  /t.   It  is  also  assumed  that  for  1 ^  j,  the  behavior  of 
the  members  of  X.(t)  is  unaffected  by  that  of  members  of 
X  (t). 

Denote  by  Y.(t)  the  number  of  persons  present  at  time  t 
who  started  in  grade  i,  and  by  Y(t)  the  entire  population 
present  in  the  organization  at  timet.   The  entire 
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organization  size,  Y(t),  at  time  t,  is  the  sum  of  all 
remaining  portions  of  cohort  sizes  which  started  in  each 
of  the  K  grades,  counting  back  M  epochs  from  the  present 
time.   Thus  we  have  that 


K  KM 

Y(t)  =   E   Y. (t)  =   Z    EX. (t-s,s)  . 
i=l   X      1=1  s=0   X 


The  expected  value  and  variance  of  Y(t)  are  then  respect- 
ively, 

K   M 
E(Y(t))  =  l        l      X.(t-s)  p.(s) 
i=l  s=0   1       x 

and 

K    M 
Var(Y(t))  =   Z    E   X, (t-s)  p. (s)  q  (s)  , 
i=l  s  =  0   x       x      x 

where   p.(0)  =  1  and  q_AQ)    =  0. 

Our  objective  is  to  find  expressions  for  the  conditional 
expectations,  E [Y ( t+1) | Y( t ) ]  and  E  [Y. ( t  +  1) | Y. ( t ) ] ,  for 
which  we  need  the  distributions  of  Y(t)  and  Y.(t).   Although 
the  first  and  second  moments  of  Y.(t)  and  Y(t)  have  simple 
forms,  explicit  formulae  for  their  distribution  functions 
are  very  unwieldy.   However,  if  all  initial  cohort  sizes 
are  large  for  all  time  t,  then  the  distributions  for  Y.(t) 
and  Y(t)  are  asymptotically  normal.   This  follows  from 
Theorem  One  and  the  fact  that  the  Y's  are  sums  of  indepen- 
dent random  variables. 


12 


Theorem  One : 

If  {X.},  iel,  is  a  family  of  random  variables,  each 

independent  and  each  distributed  binomially  with  parameters 

p.  and  N.  (for  0<p.<l  and  N.>0),  where  I      p.<°°,  then  for 
11         11  l£l   1 

W  =   Z  (X.-N.p.)/(  2  N.p.q.)^,  the  distribution  of  W 
iel   1   !  x    iel  1  1  1 

asymptotically  approaches  that  of  a  random  variable  which 
is  distributed  as  normal  (0,1)  as  N.-^viel. 

Proof:   In  order  to  prove  the  theorem,  it  is  sufficient  to 

show  that  the  moment  generating  function,  G  (m) , 

^m2  W 

approaches  the  limit  e^        as  N.->°°\/ieI. 

1.   Let  B  =  (  I    N.p.q.  )% 
iel 

00      i 

x        x 
a.   The  expansion  of  e   is   £  -rj-   and  for  x  small, 

j=0  J  • 

x2       2 
is  1+x  +  -p-  +  o(x  )  . 

•  b.   For  x  small,  the  term  (1+x)   can  be  represented 


.    nx 
by  e 


m  1    (Xi-N.p1)/B        m(X.-Nipi)/B 

G  (m)  =  E(emW)  =  eL  l£l  J=  B  E  e 

1  iel 

/u  N.   -N.p.m/B 

_  .        m/B  ,   v  i     1*1 
n  (p.e    +  q.  )    e 

iel   *       ± 

q^m/B      -p1m/B|  i 
n  I p.e      +  q.e 
ieiv  x  x 

q1m/B 
3.   From  la,  the  expansion  of  p.e      is  p. [1  +  q.m/B 

p        2  -nip • /B 

+  ^(q.m/B)   +  o(l/B  )]  and  the  expansion  of  q.e    J- 

is  q. [1  -  p. m/B  +  ^(p.m/B)2  +  o(l/B2)].   Thus, 
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q  .  m/B     -p . m/B 
Pie      +  qie        =  (p1+q1)  +  (p1qi-p1q1)m/B 

+  1-2piqi(q1+pi)(m/B)2  +  o(l/B2) 

=  1  +  35p.q.(m/B.)2j  +  o(l/B2)  . 

o 
4.   From  lb,  and  using  the  fact  that  the  term  ^p.q.(m/B  ) 

is  small  for  large  values  of  B,  and  neglecting 

2 
o(l/B  ) ,  we  have 

qim/B  -pim/B|Ni  p. qiNi (m/B) 2/2 

p.e  +q.e  I        ~   e  ,    and  hence 

i           „    /     /r,\2  I    ^p.q.N.(m/B)                  n 

Sgp.q.N.  (m/B)  .    T   *iMi    l           y             ,2 

„    ,    v           n        *iMi    l  lei                                          Jgm 

G    (m)    ~IIe  =    e                                          =    e 

iel 

in  the  limit  as  N.  tends  to  infinity  for  all  i  in  I, 

since  (1/B)2(  Z  p.q.N.)  =  1.  Q 
iel  iii 


It  is  now  possible  to  examine  how  best  to  estimate 

expected  future  values  of  Y  and  Y.  when  past  realizations  of  Y 

and  Y.  are  known.  When  the  conditional  expectation  of  a  random 
i 

variable  cannot  be  found  explicitly,  Parzen  (I960)  suggests  a 
best  linear  predictor  which  minimizes  the  mean  square  error  of 
prediction  using  a  linear  function  of  previous  realizations. 
When  those  random  variables  are  normal,  this  function 
gives  the  exact  conditional  expectation  of  the  random 
variable,  given  the  value  in  the  previous  time  periods. 
Theorem  One  justifies' our  use  of  the  best  linear  predictor. 
In  Theorem  Two,  we  derive  the  general  expression  for  this 
predictor,  which  we  later  specialize  to  our  cohort  model. 
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Theorem  Two: 

A  best  linear  predictor  is  defined  to  be  that  function 

E*(Y)  =  a  +  bX,  for  Y  and  X  random  variables, 

2  * 

which  minimizes  E[(Y-E*[Y])  ],  denoted  Var  (Y).   The  expres- 
sions for  E*(Y)  and  VAR*(Y)  are 


E*(Y)  =  E(Y)  +  [Cov(Y,X)/Var(X)] [X-E(X)]  and 

Var*(Y)  =  Var(Y)  (l-p2(X,Y)),  where  the  term  p(X,Y) 
denotes  the  correlation  coefficient  of  X,Y. 

Proof: 

1.  Var*(Y)  =  E[(Y-(a+bX) )2] . 

2.  ^  Var*(Y)    =    -2E [Y-(a+bX) ]    =    -2E[Y]    +    2(a+bE[X]), 
which   when    set    to    zero,    yields    a   =    E[Y]    -   bE[X]. 

3.  g&  Var*(Y)    =    -2E[XY-X(a+bX)]    =    -    2E[XY]    +    2aE [X]  + 2bE [X2], 

which,    when    set    to    0    and   the    substitution    for   a   is    made, 

yields      b   =   E[XYj>   "   E^JEtYl    =    Cov[X,Y]/Var[X]  . 
E[X    ]    -   E    [X] 

4.  Substituting    for   a   and   b    in    the    expression    for   E*(Y) 
and  Var*(Y),    E*(Y)    =    E[Y]    -   bE[X]    +   bX 

=    E[Y]    +    (Cov[X,Y]/Var[X] ) (X-E[X] )    and 

Var*(Y)    =    E[(Y-E*[Y]  )2]  =  b2E[(X-E[X]  ]2)-  2bE  [(X-E  (X))(1-E(Y))) 

=   Var[Y]    +   b2Var[X]    -    2bCov[X,Y] 

=   Var[Y]    +    [Cov(X,Y)/Var(X)]2Var[X] 
-    2(Cov[X,Y]/Var[X] )Cov[X,Y] 

=   Var[Y] [l-Cov2[X,Y]/(Var[X]Var[Y] )] 

=   Var[Y] (l-p2[X,Y] ).     Q 
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In  a  similar  manner,  E**(Y)  =  E(Y)+b1(X-E(X))+b2(Z-E(Z)) 


minimizes  Var** ( Y) =  E [ ( Y-E** ( Y) )2]  when 


b1  =  D[Cov(X,Y)/Var(X)-  Cov( Y,Z ) Cov(X,Z )/{Var (X)Var( Z ) } ] , 
b2  =  D[Cov(Z,Y)/Var(Z)  -  Cov(  Y,X)Cov(X,Z  )/{Var  (X)Var(  Z) }  ]  , 
and 

D   =    l/(l-p2[X,Z]). 

The  expression  for  Var**(Y)  then  becomes 

Var**(Y)  =  Var(Y)  +  b-^VarU)  +  b22Var(Z)+  2b1b2Cov(X, Z ) 
-  2b1Cov(Y,X)  -  2b2Cov(YsZ). 

We  now  have  an  expression  for  the  best  linear  predictor 
for  Y(t)  when  we  know  the  past  realizations,  Y(t-l)  and 
Y(t-2),  regardless  of  the  distribution  of  Y(t).   For  large 
cohort  sizes  at  entry  into  the  system,  we  have  that  this 
predictor  is.  best  (i.e.,  it  is  E  [Y(  t )  |  Y(t-l)  ,  Y(  t-2  )  ]  )  , 
since  the  distribution  of  Y(t)  will  be  very  close  to  normal. 
Hence  the  functions  E*  and  E**  are  actually  conditional 
expectations  given  the  past  period  error  and  the  last  two 
period  errors  respectively. 

The  only  terms  used  in  expressing  E*  and  E**  which  have 
not  been  derived  are  the  Cov  [Y(  t )  ,Y(  t-1)  ]  and  Cov  [Y(  t )  ,Y(t-2  )] 
Since  independence  exists  between  X.(t-y,y)  and  X.(t-y,y) 
for  i  i-  j  ,  and  independence  exists  between  X.(t-s,s)  and 
X. (t-u,u)  for  u^s,  the  covariance  between  all  such  X  terms 
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is  zero  except  for  that  between  X.(t-s,u)  and  X.(t-s,w). 
Here,  for  w>u,  we  may  interpret  X.(t-s,u)  to  be  the  remnants 
of  the  initial  size  at  entry,  X.(t-s);  and  X.(t-s,w)  to  be 
the  remnants,  some  w-u  epochs  later,  of  X.(t-s,u).   Given 
that  a  member  of  X.(t-s)  remains  until  a  time  u  epochs 
later,  the  probability  of  his  remaining  until  w  epochs  after 
entry  into  the  system  is  p.(w)/p.(u).   (Assuming  a  person 
who  leaves  the  system  does  not  return,  0<_[p .  (w)/p .  (u)  ]  <1 
for  w>u.)   The  conditional  expectation  of  X.(t-s,w)  given 
the  realization  of  X.(t-s,u)  is  then  [p . (w)/p . (u) ] X. ( t-s ,u) . 
Hence  the  covariance  of  X.(t-s,u)  and  X. (t-s,w)  is  derived 
by  the  following  argument: 

E[X1(t-s,u),Xi(t-s,w)]  =  [pi(w)/p1(u)]E[X1(t-s,u)2] ; 

Cov[Xi(t-s,u),Xi(t-s,w)]  =  [pi(w)/pi(u)]E[Xi(t-s,u)2] 

-  E[X1(t-s,u)]E[Xi(t-s,w)] , 

where 

E[X.(t-s,u)]  =  p.(u)X.(t-s)  and 

E[XjLCt-s,w)]  =  pi(w)X1(t-s); 
and 

E[X1(t-s,u)]E[Xi(t-s,w)]=  pi(w)pi(u) [Xi(t-s)]2 

=  [p^wVp^u)]  [pi(u)]2[X.(t-s)]2 

=  [p.(w)/pi(u)]{E[X.(t-s)]}2. 
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Hence, 

Cov[X1(t-s,u) ,X1(t-s,w)]  =  [pi(w)/p1(u)]E[Xi(t-s,u)2] 

-[p1(w)/pi(u)]E2[X1(t-sJu)] 
=  [p1(w)/p1(u)]Var[X1(t-s,u)] 
=  [p1(w)/pi(u)]p1(u)q1(u)X1(t-s) 
=   p1(w)q1(u)Xi(t-s). 

To  express  the  covariance  between  X.(t-s,s)  and  the  remain- 
ing number  of  the  cohort  with  age  s-1  (one  period  ago), 
X.  (t-SjS-1)  j  we  substitute  w=s  and  u=s-l  to  obtain 

Cov[X. (t-s,s) ,X. (t-s, s-1)]  =  p. (s)q. (s-l)X. (t-s) ,  for  s>l. 

Similarly,  by  substituting  w  =  s  and  u=  s  -  2, 

Cov[X. (t-s,s) ,Xi(t-s,s-2)]  =  pi(s)qi(s-2)Xi(t-s),  for  sl2. 

To  obtain  Cov[Y. ( t ) ,Y. (t-1) ] ,  the  covariance  existing 
between  all  the  remnants  at  time  t  from  cohorts  which  entered 
grade  i  and  the  remnants  from  the  same  cohorts  at  time  t-1, 
we  sum  on  s  from  1  to  M  to  obtain, 


M 
Cov[Y. (t),Y. (t-1)]  =   £  Cov[X  (t-s,s),X  (t-s, s-1)] 
1      1         s  =  l 

M 
=  I   p  (s)q, (s-l)X. (t-s). 
s=l  x     x      x 
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Similarly,  (mutatis  mutandi) 

M 
Cov[Y. (t),Y. (t-2)]  =   Z  Cov[X, ( t-s ,s ) ,X. ( t-s ,s-2 ) ] 
11         s  =  2      x         ± 

M 
=  I   p  (s)qi(s-2)X.(t-s). 
s  =  2 


The  independence  between  cohorts  starting  in  different 
grades  has  already  been  established;  thus,  to  obtain  the 
covariance  between  all  remnants  at  time  t  with  all  remnants 
at  time  t-1  and  with  all  remnants  at  time  t-2,  we  sum  on  i 
from  1  to  K  the  covariances  of  the  Y.(t)  terms  to  obtain, 
respectively , 

K 
Cov[Y(t),Y(t-l)]  =   Z  Cov[Y. (t),Y. (t-1)] 

i=l  * 

K    M 
=  .  Z    Z  p1(s)qi(s-r)X  (t-s) 
i=l  s=l 


and 


K 
Cov[Y(t),Y(t-2)]  =   Z  Cov[Y, (t),Y. (t-2)] 

1=1      x      x 


K    M 

Z    Z  p  (s)q.(s-2)X  (t-s). 
i=l  s=2 


To  summarize,  the  expressions  for  estimating  the 
remaining  persons  in  the  system  who  started  in  grade  i  are 


E*(Y,(t))  =  E[Y,(t)]  +  b.  .{Y.(t-l)  -  E[Y.(t-l)]}, 
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where 

b     =  Cov[Y,(t),Y,(t-l)]/Var[Y,(t-l)]; 

J-  j"  -L         -L  1 

and 

E**(Y.(t))  =  E[Y.(t)]  +  b}    ,  (Y.(t-l)  -E[Y,(t-l)]} 


for 


+  bijt{Yi(t"2)  "  E[Y.(t-2)]}, 


bi  t  =  d(A!-A2A3^   and  bi  t  =  d(A^-A1A2)' 
where 

d  =  l/{l-p2[Y1(t-l),Y1(t-2)]}} 

A1   =  Cov[Y1(t),Yi(t-l)]/Var[Y1(t-l)]  =  b±    t, 

A2  =  Cov[Yi(t-l),Y1(t-2)]/Var[Y1(t-2)]  =  b±    t__1 

and 

A3  =  Cov[Y1(t),Y1(t-2)]/Var[Y1(t-2)]. 

The  expressions  for  estimating  the  number  of  total 
remaining  persons  are  then 


E*(Y(t))  =  E[Y(t)]+  bt{Y(t-l)  -  E[Y(t-l)]} 


and 


E**(Y(t))  =  E[Y(t)]  +  bJ{Y(t-l)  -  E[Y(t-l)]} 
+  b2{Y(t-2)  -  E[Y(t-2)]}; 
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where 


bt  =  Cov[Y(t),Y(t-l)]/Var[Y(t-l)] 

K  K 

=   I  Cov[Y. (t),Y. (t-1)]/  {  E  Var[Y. (t-1)]}, 
1=1     x     1  i=l     x 


bj  =  D{bt  -  bt_1Cov[Y(t),Y(t-2)]/Var[Y(t-2)]}, 
b2  =  D{Cov[Y(t),Y(t-2)]/Var[Y(t-2)]  -  b^.^}, 


D  =  1/{1  -  p2[Y(t-l),Y(t-2)]}. 


This  model  for  predicting  the  total  number  in  the  system 
at  time  t  is  applied  to  the  problem  of  predicting  total 
student  enrollment  in  Section  V. 


and 
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III.   MEETING  SPECIFIED  GOALS 

Consider  an  organization  in  which  desired  numbers  are 
specified  for  personnel  in  each  grade  i  at  times  t+1, 
t+2,  ...  in  the  future.   It  is  the  objective  of  this 
section  to  extend  the  model  of  Section  II  for  predicting 
the  future  grade  size  within  the  organization. 

Define  p. . (t,u)  to  be  that  fraction  of  personnel  in 
grade  j  at  time  u  given  they  entered  the  system  in  grade  i 
at  time  t,  where  t<u.   If  the  grades  are  numbered  in 
hierarchal  order,  i<j  means  a  promotion,  i>j  means  a 
demotion  and  i=  j  means  that  p..(t,u)  is  the  fraction 
which  remain  in  grade  i  in  the  period  t  to  u.   If  the  rates 
of  movement  between  grades  are  stable  for  the  time  period 
being  considered,  an  assumption  of  stationarity  or 
independence  of  time  t  can  be  made  about  the  transferred 
fractions,  and  p..(t,u)  can  be  expressed  as  p.. (u-t);  that 
is,  the  fraction  transferred  from  grade  i  to  j  is  a  func- 
tion only  of  the  elapsed  time  u-t .   Let  X..(t,u)  denote 
the  number  in  grade  j  at  time  u  of  that  cohort  which 
entered  the  organization  at  time  t  in  grade  I.   We  then 
have  that  X..(t,u)  is  a  binomially  distributed  random 
variable  with  parameters  X. (t)  and  p.. (u-t). 

X  _L  J 

By  representing  the  number  of  persons  in  grade  j  at 
time  t  as  YJ(t)  in  terms  of  the  cohort  portions  remaining 
from  the  initial  size  at  entry  into  grade  i  in  all  previous 
epochs,  we  sum  on  i  and  sum  on  s  to  obtain 
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M    K 
YJ (t)  -   I    EX.. (t-s,t) . 
s=0  i  =  l  « 

It  should  be  noted  that  X. . (t-s,t)  is  independent  of 
X.  . (t-r,t)  for  s  7*r,  since  the  cohort  entering  grade  i 
at  time  t-s  acts  independently  of  the  cohort  entering 
grade  i  at  time  t-r.   Note  also  that  X,  .(t-s,t)  is  inde- 
pendent of  X..(t-s,t)  for  i^k,  since  these  are  the  rem- 
nants  presently  in  grade  j  at  time  t  of  those  who 
entered  the  system  at  time  t-s  in  different  grades.   For 
s  =  0,  X..(t,t)  is  0  when  i^j  and  X..(t,t)  is  X.(t); 
that  is,  a  cohort  which  just  entered  the  system  at  time  t 
in  grade  i  will  have  no  opportunity  to  diminish  in  size 
or  to  be  transferred  to  another  grade  in  the  same  time 
period.   Of  note  is  the  fact  that  independence  does  not 
exist  between  YJ(t)  and  Y  (t)  for  j  ^£,  since  in  each  of 
these  random  variables,  there  may  exist  people  from  the 
same  initial  cohort.   At  this  point,  the  distinction 
between  Y.(t)  and  Y  (t)  should  be  clear.   In  expressing 
Y.(t),  we  are  counting  the  remnants  of  X.(t-s)  in  all 
grades.   In  expressing  Y1(t),  we  are  counting  the  number 
presently  in  grade  i  as  remnants  from  all  previous  time 
periods  of  cohort  entries  into  all  grades. 

We  are  now  able  to  express  the  means  and  variances  of 
Y^(t)  as  the  sum  of  independent  means  and  variances 
respectively  of  X..(t-s,t); 

KM  KM 

E[YJ(t)]  =  E[  I         I    X..(t-s,t)]  =   £    £  p.  .(s)X. (t-s) 

i=l  s=0  1J  i=l  s=0  1J     1 
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and 

K        M 


Var[YJ(t)]    =      Z         Z    p      (s)q    .(s)X    (t-s)    for   j=ls25...K; 
i=l    s=o    1J  1J 


where 


p  .  .  ( 0  )    =    1 ,    and   q . , ( s )    =    1   -   p  .  .  (  s  )  . 

11  -L  J  -'-J 


Denote  by  G.(t+1),  G.(t+2),...  the  desired  goals  for 
the  size  of  grade  i  at  times  t  +  1,  t+2,  ...  respectively. 
Denote  by  Z.(t+1),  Z.(t+2),...  the  numbers  in  grade  i  at. 
times  t+1,  t+2,...  which  were  in  the  system  at  time 
t,  t+1,... j  in  one  of  the  K  grades  of  the  organization. 
The  number  X.(t+1)  is  now  a  controlled  variable;  that  is, 
the  number  to  recruit  into  grade  i  at  time  t+1  in  order  to 
attain  G.(t  +  1)  is  X±(t  +  1)  =  G^t  +  1)  -  Z^t+1). 

We  now  assume  that  no  demotions  take  place:  p. . (s)  is 
zero  for  j<i.   At  the  end  of  each  epoch,  a  member  is 
promoted,  remains  in  present  grade  or  leaves  the  organi- 
zation.  With  these  restrictions  placed  on  the  system,  we 
have  that  the  expected  values  for  Z.(t+1)  and  Z.(t+2)  are 

i        M 
E[Z, (t+1)]    =      Z         Z    X.(t-s)    p       (s) 
1  j=l    s=0    J  J1 

and 

i        M 
E[Z    (t+2)]    =      Z         Z    X    (t-s+l)p.    (s),    for   i    =    1,2, ...K. 
1  j=l    s=0    J  J1 
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The  variance  terms  are 


Var[Z±(t  +  l)] 


i    M 

Z    Z  X.  (t-s)p. . (s)q. . (s) 


j=l  s  =  0 


Ji 


lJi 


and 


i    M 
VartZ. (t+2)]=   Z    Z  X  .  ( t-s  +  l)p . . (s )q . . ( s ) 

j=l  s=0  J         ^     ^ 


With  the  same  definition  as  in  Section  II  for  p.(s)  (that 

is,  the  fraction  of  X.  (t-s)  remaining  in  the  system  at 

1  K 

time  t),  we  have  a  restriction  that   Z  p..(s)  =  p.(s) 

for  all  values  of  s  (i.e.,  s=0,l,...M). 

Suppose  our  problem  is  to  avoid  overmanning  or  under- 
manning  grade  i  (i=l,2,...K)  in  the  periods  t+1,  t+2,  when 
the  recruiting  for  these  periods  must  be  planned  ahead. 
Assume  that  there  are  penalties  defined  when  the  number 
of  personnel  in  each  grade  is  above  or  below  the  goals; 

i.e.,  C.  (t)  when  the  number  is  over  the  desired  values  and 

3      1 

C.~(t)  when  below.   The  objective  function  to  be  minimized 
is 


r  2 


E, 


K 


Z    Z  {C.  (t+x)Max[0,G. (t+x)-X. (t+x)-Z. (t+x)] 
Lx=l  i=l 


-C."(t+x)Min[0,G. (t+x)-X. (t+x)-Z. (t+x)]} 


where  E„(0  denotes  the  expected  value  of  (•)  over  the  joint 
density  of  Z  =  (  Z±(  t  +  1)  ,Z2(  t  +  1)  ,  .  .  .  ZR(t+l)  ,Z1(t+2  ),. .  .  ZK(t+2)), 
a  vector  of  random  variables  with  a  multivariate  normal 
distribution.   The  optimal  values  of  X.(t+1)  and  X.(t+2) 
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+ 


would  minimize  this  objective  function.   When  C.  =C.  =C, 

ill* 
2    K 
the  expression  reduces  to   Z    £  C.(t+x) 

x=l  i=l  ! 


-E[Z1(t+x)]} 


{G. (t  +  x)  -  X. (t  +  x) 

1         1 


,  where  the  expected  values  of  Z.(t+1)  and 


Z.(t+2)  are  given  above. 

We  now  assume  that  the  realizations,  Y.(t)  i=l,2,...K 
are  known  and  proceed  to  determine  the  best  predictors  for 
Z.(t+1)  and  Z. (t+2).   Assuming  the  normality  property  for 
Z.  and  Y. 3  this  best  predictor  will  be  the  expectation  of 


', .    conditioned  on  the  values  of  Y.,  j=l,2,...i,  for  t 


he 


present  time;  defining  E'[Z.(t+l)]  as  the  best  predictor 
of  Z.(t  +  1)  and  E,T[Z.(t+2)]  as  the  best  predictor  of  Z .  (t+2) 
when  the  values  of  Y.(t)  are  known,  j=l,2,...K,  we  have 

J 

E'[Z.(t+l)]  =  E[Z  (t+1)  |  Y  (t),  j=l,2,...i] 
and 

E"[Z  (t  +  2)]  =  E[Z  (t  +  2)  |  Y  (t),  j=l,2,...i] 

where  the  realizations  of  Y.(t)  for  j>i  have  no  effect  on 

J 

E'  and  E''  and  are  therefore  not  considered  in  the  expres- 
sions.  (This  follows  from  the  fact  that  p..(s)  is  zero 
for  i<j  .  ) 

With  the  independence  of  Y.(t)  and  Y.(t)  for  i^j,  we 
have  that  Cov[Y1  ( t )  ,Y  .  ( t )  ]  =  0  for  i^j.   Since  X±(t+1)  is 
a  control  variable,  we  also  have  that  Var  [YJ  ( t+x)]  =  Var  [Z  .(t+x)] 
and  CovfYJ (t+x) ,Y. (t)]  =  Cov [Z. ( t+x) ,Y . ( t ) ]  for  x=l,2  and 
i=l,2,...K.   Extending  Theorem  Two  of  Section  II  to  E'  and 
E''  we  thus  have  that 
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E'(Z.(t  +  l))  =  E[Z.(t  +  l)]  +   E  d^{Y.(t)  -  E[Y.(t)]}, 

i  i  j=1  z         3  J 

and 

i 
E"(Z.(t  +  2))  =  E[Z,(t+2)]  +  I    giX{Y.(t)  -  E[Y,(t)]}, 

where 

d^1  =  Cov[Y1(t+l),Y.(t)]/Var[Y.(t)] 
and 

g^1  =  Cov[Y1(t+2),Y,(t)]/Var[Y.(t)] 

for  i  =  1,2,...K.   The  terms  E[Z.(t+x)],  x=l,2,  and 
Var[Yj(t)]  have  been  derived  previously,  leaving  the  terms 
Cov(Yi(t+x),Y. (t)) ,  x=l,2,  to  be  derived: 

J 

Given  that  a  member  who  entered  the  organization  at 
time  t-s  in  grade  j  was  in  the  organization  at  time  t, 
(i.e.  a  member  of  X.(t-s,s)),  the  probability  that  this 
member  will  be  in  grade  i  at  time  t+x,  is  p . . ( s+x)/p . (s ) . 
The  conditional  expectation  E [X . . ( t-s  ,s+x) I X  .  ( t-s  ,s ) ] 
(that  is  the  expectation  of  the  number  in  grade  i  at  time 
t+x  of  those  who  started  in  grade  j  at  time  t-s,  conditioned 
on  the  number  remaining  in  the  organization  at  time  t  who 
started  at  time  t-s  in  grade  j),  is  p . . ( s+x)X . ( t-s ,s )/p . ( s ) . 
Using  a  similar  argument  as  that  in  deriving  Cov  [Y(  t )  ,Y(t-l)  ] 
in  Section  II,  we  have  that 
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pn.  .  (s  +  x) 


E[Xj.(t-s,s+x)JXj.(t-s3s)]  =  ^(s) E[(Xj(t-s,s))^] 


J 
hence 


p  .  . (s  +  x) 
Cov[X   (t-s, s+x), X. (t-s, s)]  =  i    fts, Var[(X.(t-s,s)] 


=  p  .  . (s+x)q . (s)  X. (t-s) . 

Since  independence  exists  between  X.(t-s.s)  and  X.(t-r.r) 

and  between  X.. (t-s. s+x)  and  X..(t-r,r+x)  for  s  7*  r ,  we  have 

M-x 
Cov[Y1(t+x) ,Y.(t)]  =   Z  Cov[X. . (t-s, s+x) ,X.(t-s)] 
J        s=0     J1  J 

M-x 
=   I  p   (s+x)q  (s)X  (t-s), 
s=0  J        J     J 

for  x  =  1,2;   j=l,2,...i. 


The  use  of  E'  and  E''  in  place  of  E(Z.(t+l))  and 

E(Z.(t+2))  respectively  (using  the  present  values  of 

Y.(t)),  is  relevant  in  the  objective  function  when 

C.(t+x)  =  C.+(t+x)  =  C.~(t+x),  x=l,2.   The  variances 
1         1  l 

(using  the  fact  that  Cov( Y . ( t ) ,Y . ( t ) )  is  zero  for  i  ^j), 
are 


i     .  .  ? 

Var'  [Z, (t  +  1)]  =  Var[Z, (t  +  1)]  +   Z  { ( d^ 1 rVar [Y, ( t ) ] 

i  i         j=1    z  J 


-  2d^1Cov[Y1(t+l) ,Y.(t)] } 
z  J 
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and 

i     •  •  ? 
Var"  [Z.  (t  +  2)]  =  Var  [Z  .  (t  +  2)  ]  +  I    {  (g^1 )  Var[Y  .  ( t )  ] 

i  i         j=1   «        J 

-  2gJiCov[Yi(t+2),Yj(t)]>. 

An  application  of  such  a  model  might  be  that  of  a  uni- 
versity system  with  the  grades  defined  for  curriculum  and 
level  (upper  and  lower,  for  instance)  in  which  the  goals 
G.(t+1)  and  G.(t+2)  were  specified  to  fully  utilize  the 
facilities  without  inflicting  a  lack  of  classrooms  by 
over-enroUing.   The  costs,  C.(t+1)  and  C.(t+2)  might  be 
based  on  the  losses  encurred  financially  by  overstaffing 
for  a  below-desired-level  of  class  size  and  the  losses 
encurred  by  the  added  administrative  burdens  of  rejecting 
enrolled  students  when  overages  occur. 
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,  IV.   SENSITIVITY  TO  ERRORS  IN  PROBABILITY  ESTIMATION 

The  fraction  of  remaining  individuals  who  start  in  a 
grade  and  remain  for  s  epochs,  s=l,2,...M  is  used  as  an 
estimate  of  the  probability  of  this  event.   In  the  paper  by 
McAfee  (1970),  a  statistical  test  is  made  on  a  sample  of 
size  three,  to  test  the  hypothesis  that  p. (s)  for  each 
cohort  is  from  the  same  population.   This  is  to  say  that 
for  moderate  sample  sizes  in  estimating  p.(s),  we  have  a 
sample  mean  to  use,  which  for  large  sample  sizes,  approaches 
the  true  value  of  p.(s).   At  best,  we  use  the  estimate  and 
for  this  reason  examine  the  sensitivity  of  the  model 
described  in  Section  II  to  error  which  may  exist  between 
our  estimate  and  the  true  value  of  p.(s);  viz.,  the  sensi- 
tivity of  the  expected  values,  E[Y(t)],  E*[Y(t)]  and 
E**[Y(t)]  to  errors  in  p.(s).   We  shall  consider  two  cases: 
one,  in  which  an  error  Ap . ( s )  exists  between  our  estimate 
and  the  true  value  of  p.(s)  for  some  s  and  i;  the  second, 
in  which  an  error  exists  for  all  s  and  i. 

Taking  the  partials  of  E[Y(t)],  E*[Y(t)]  and  E**[Y(t)] 

with  respect  to  p.(s)  yields  the  following  expressions 

(where  -3 — /    \    denotes  the  partial  differential  of  ( • )  with 
dp.(s; 

respect  to  p^(s)): 

dE[Y(t)]/dpi(s)  =  d/dp1(s)  [Xj_(  t-s  )p±  (s  )  ]  =  X±(t-s); 
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dE*[Y(t)]/dPi(s)  =  d/dp.(s)  E[Y(t)] 


+b  d/dp.(s)  (Y(t-l)  -  E[Y(t-l]} 
+  {Y(t-l)  -  E[Y(t-l)]  }d/dp1(s)(bt) 
=  Xi(t-s)  -  btXi(t-s-l)  +  Ry*; 
and 

dE**[Y(t)]/dp.(s)  =  d/dp.(s)E[Y(t)] 

+  bJd/dp±(s){Y(t-l)  -  E[Y(t-l)]} 
+  b^d/dPi(s){Y(t-2)  -  E[Y(t-2)]} 

+{Y(t-l)  -  E[Y(t-l)]}d/dpi(s)(bJ) 
+{Y(t-2)  -  E[Y(t-2)]}d/dp1(s)(b^) 
=  Xi(t-s)  -  bjxi(t-s-l)  -b^X1(t-s-2)+R»*; 

where  R*  and  R**  represent  sums  of  remainder  terms  negligible 
in  comparison  to  other  terms  in  the  expressions  respectively 
for  dE*/dp.(s)  and  dE**/dp.(s).   (See  the  appendix  for  the 
exact  expressions  represented  by  R. ) 

Neglecting  R  for  small  errors  in  the  estimate  for  p.(s), 
the  changes  in  predicted  values  are  then 


AE[Y(t)  ]  =  Xi(t-s)  APjL(s), 

AE*[Y(t)]  =  {Xi(t-s)  -  btXi(t-s-l)}  Ap1(s), 
and 

AE**(Y(t))  =  {X.(t-s)  -  b^X.(t-s-l)  -  b^X. ( t-s-2  )  }  Ap  .  (s). 
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It  is  observed  in  the  expression  for   E*(Y(t)),  that  as 
b,  approaches  one,  the  error  in  the  predicted  value  is 
Api(s)AXi,  where  AX. =X . ( t-s )-X. ( t-s-1) .   Since  the  covari- 
ance  of  Y(t)  and  Y(t-l)  is  always  positive,  b,  takes  on  a 
value  between  zero  and  one,  and  b,  acts  as  a  dampening 
coefficient  for  errors  in  the  predicted  value  of  Y(t)  using 
E*. 

Similarly  for  the  expression  of  AE**(Y(t)),  as 

1   2 
(b,+b,)-*-l,  the  error  in  the  predicted  value  is 

Ap1(s)(b^AxJ  +  b^AX^),  where  the  terms  AX?"  =  X.  (t-s  )-X.  (t-s-1) 

and  AX?  =  X.(t-s)  -  X.(t-s-2).   (Viz.,  the  differences 

between  the  cohort  entering  grade  i  at  time  t-s  and  those 

which  enter  the  periods  before.)   The  dampening  effect  of 

the  error  in  E**[Y(t)]  is  evident  with  the  b,  coefficients 

(i=l,2).   Thus,  a  smaller  error  in  the  predicted  value  of 

Y(t)  results  when  E*  and  E**  are  used  rather  than  E. 

Let  us  assume  now  that  p.(s)  =  (p.)   for  all  i;  i.e., 

we  have  a  geometric  distribution  of  remaining  members  of 

each  grade.   If  X.(t)  were  a  constant  for  all  t  (that  is, 

X. (t)  =  X. ) ,  then 


K  K 

E[Y(t)]  =   I  E[Y, (t)]  =  I 
1=1    x       i=l 


S  X.pi- 

s  =  0 


K 

Z  X  /(1-p  ); 
i  =  l  1      1 


K 

E*(Y(t))  =   Z  X,/(l-p.)  +  b, {Y(t-l)-X./(l-p, )} 
i=1  l      l      Z  11 


for 


bt  =  b  = 


K  " 

Z  p.  X./d-p.  ) 
Li  =  l 


/ 


'  K  2" 

E  p  X  /(1-p/) 

Li=l  x  x  . 
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and 


K 


E**(Y(t))  =  t    X1/(l-p.)  +  bJ[Y(t-l)  -  Xi/(l-pi)] 

i=l 


for 


+  b£[Y(t-2)  -  X±/(l-p1)], 


K  K 

hi   =  [b/(l-b)][l  -   Z  p.3X./  I   p.X.)]  =  b1 
r  1=1  X   X  1=1  X  X 


and 


fr 


bj  =  [l/(l-b)] 


K 


3- 


K 


E  p. JX./(  Z  p.X. 
.  -,  1  1  .  T  1  1 
1=1       1=1 


-  (b)2>  =  b2 


The  resultant  terms  for  the  changes  in  prediction  are  then 

K 
AE(Y(t))  =  I    Ap.X  /(1-p /)  , 
1  =  1^      X 

K 
AE»(Y(t))  =   E  Ap±X  (l-b)/l-p^)  =  (l-b)AE(Y(t)), 
i  =  l 


and 


AE**(Y(t))  =   I  Ap.X.  (l-b1-b2)/(l-p.2)  =  ( l-b1-^)  AE(Y(  t )). 
i=l   x  1  x 


The  dampening  factors  for  AE*(Y(t))  and  AE**(Y(t))  are 
immediately  evident. 

Using  E*  or  E**  in  place  of  E,  we  can  reduce  the  predic- 
tion error  caused  by  errors  in  p.(s). 
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V.   APPLICATION  TO  UNIVERSITY  ENROLLMENT 

The  problem  of  predicting  total  student  attendance  at 
the  University  of  California,  Berkeley,  is  approached  using 
the  model  for  the  best  predictors,  E*(Y(t))  and  E**(Y(t)). 
Data  was  obtained  from  Berkeley  for  new  student  enrollments 
and  is  tabulated  in  Table  I. 

Total  Student  predictions  are  required  for  only  the 

fall  quarter  of  each  year.   To  estimate  the  p.  (s)  (the 

th 
probability  of  a  student  remaining  to  the  s    subsequent 

fall  period  following  entry  into  grade  i,  quarter  j),  data 
was  collected  on  the  numbers  of  students  from  the  cohorts 
entering  in  the  Pall  of  1966  and  the  Winter,  Spring  and 
Summer  of  1967  which  were  still  in  attendance  each  succeed- 
ing fall  term.   The  most  recent  data  available  was  from 
1969,  which  included  at  most  3  years  for  any  cohort.   To 
estimate  p.  (s)  for  the  years  in  attendance  4-6,  we 
assumed  students'  attendance  behavior  over  time  is  essen- 
tially stationary  and  used  past  cohort  data  analysis  found 
in  Suslow  et  al  (1968).   Our  estimates  of  p.  (s)  are  given 
in  Table  II,   It  should  be  pointed  out  that  during  the 
years  1961  through  1966,  the  University  followed  a  semester 
system.   Starting  in  1967*  the  University  switched  to  a 
quarter  system. 

The  parameters  calculated  with  the  data  in  Tables  I  and 
II  are  shown  in  Table  III.   Our  estimates  of  total  student 
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TABLE  III 
VALUES  USED  IN  PREDICTION 


CASE  A 
(Summer  enrollees  treated  as  Fall  enrollees) 


Year 
1966     1967     1968     1969 


E(Y(t))  16919.5   18339.5   18365.5  18133.7 

Var(Y(t))^  67.66    67.86     71.02  70.83 

Cov(Y(t) ,Y(t-l) )  2261.1    2186.0  23^9.6 

Cov(Y(t),Y(t-2))  984.5  970.6 

p(Y(t),Y(t-l)  .49245    .45358  .46707 

bt  .49400   .47471  .46586 

b|  .48641  .46055 

b2  -.02561  -.01305 

Y(t)-E(Y(t))  -172.5    -2.518   -374.49  -17.742 

Var*(Y(t))^  59.06     63.29  62,63 

Var**(Y(t))1"2  63.49  62.63 


CASE  B 
(Summer  enrollees  treated  separately) 

Year 


1966     1967     1968     1969 


E(Y(t)) 
Var(Y(t)  p 
Cov(Y(t) ,Y(t-l)). 
Cov(Y(t) ,Y(t-2)  ) 
p(Y(t) ,Y(t-l)) 

bt 

>S 

Y(t)-E(Y(t)) 

Var*(Y(t))lg 
Var**(Y(t))^ 


16919.5 

18186.9 

17848.4 

17329.6 

67.66 

68.73 

73.04 

73.60 

2177.6 

2344.6 

2398.7 

971.9 

1077.6 

.46828 

.46704 

.44559 

.47577 

.49637 

.44955 

.51040 

.44672 

.03070 

.00637 

-172.5 

150.12 

142.61 

786.42 

60.75 

64.59 

65.89 

64.74 

65.78 
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enrollment  In  the  falls  of  1967  through  1969  are  given  in 
Table  IV  with  actual  enrollment  figures  for  comparison. 
Due  to  a  quota  limit  for  fall  enrollees  and  the  start  of 
year  round  operations  in  the  Summer  of  1968,  it  was  felt 
that  summer  enrollees  in  1968  and  1969  might  not  behave  as 
summer  enrollees  in  1967*  but  in  fact  be  early  fall  appli- 
cants who  enrolled  in  the  summer  rather  than  risk  unsuc- 
cessful enrollment  in  fall.   This  fact  plus  the  relatively 
small  sample  size  from  1967  for  estimating  the  probabilities 
for  summer  enrollees  to  remain  in  the  system,  leads  to  two 
cases  for  estimation:   Case  A,  in  which  summer  enrollees 
are  treated  as  fall  enrollees;  and  Case  Bs  in  which  summer 
enrollees  are  considered  separately  from  the  fall  enrollees. 
The  two  cases  are  tabulated  in  Tables  III  and  IV. 
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TABLE  IV 

PREDICTED  TOTAL  PALL  ENROLLMENT 
(With  plus  or  minus  two  standard  deviations) 

CASE  A 
(Summer  Enrollees  treated  as  Fall  Enrollees) 


Estimate  using  E 
Estimate  using  E* 
Estimate  using  E** 


1967 


Year 
1968 


1969 


18339±135   18365±1^2   18133±1^1 

18256±118   18364±126   17959±125 

18368±126   1796l±125 


CASE  B 
(Summer  Enrollees  treated  separately) 


Estimate  using  E 
Estimate  using  E* 
Estimate  using  E** 


1967 


Year 
1968 


1969 


18186±137   17848±146   17329±1^7 

1810^±122   17992±129   17393±131 

17919±129   1739^±131 


Actual  Total  Enrollment 


18337 


17991 


18116 
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APPENDIX  A 

REMAINDER  TERMS  TO  PREDICTION  ERROR 

In  Section  IV,  expressions  for  dE*/dp.(s)  and  dE**/dp.(s) 
are  given  in  which  small  remainder  terms  are  represented  by 
R*  and  R**  respectively.   The  term  R*  represents 
{Y(t-l)  -  E[Y(t-l)] )db, /dp. (s)  where 

{X.(t>-s-I)q,(s-l)-X,  (t-s)p.  (s+l)-b,  X.  ( t-s-1)  [l-2p.  (s)]} 

dbVdp,(s)=  -i i ± i £-i ± . 

z       z  Var[Y(t-l)] 


The  term  R**  represents  the  sum, 

{Y(t-l)-E[Y(t-l)]}dbJ/dp1(s)  +  { ( Y( t-2 )-E [Y(t-2 ) ] }db2/dPi ( s ) . 
Msl/x     i     \  a2  Jv,   m       rW+-     iM     k  Cov[Y(t)  ,Y(t-2)]Vr,      ns 

+  Var[Y(t-l)]  (A1~A2~A3~A^+A5+A6) 3 

?  a2 

db^/dPl(s)  =^iF^32yr{Cov[Y(t)aY(t"2)]"btbt-iVar[Y(t>1)3}(DrD2) 

+  Var[Y(t-2)]    t^-Cg-Cg-C^) , 
where 

Al   =   Var[Y(t-D]     [X.(t-s-l)q.(s-l)    -   X.  (t-s  )p  .  (s  +  1)  ] 


A2    =    btXi(t-s-l)  [l-2p1(s)] 


3    =    bt_1[X.(t-s-2)q1(s-2)    -    X±  ( t-s  )Pj_  (s  +  2  )  ] 
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A4    =   £2JL^ffl^^^(t-s-2)qi(s-l)-X1(t-s-l)p1(s  +  l)} 

_    Cov[Y(t);Y(t-2)]      Var[Y(t)j  f+         ?u  ,    ., 

A5 Var[Y(t-2)] ViFTYTt^2TT  bt-lXi(t"s"2)  [1"2pi(s)  ] 

A           Cov[Y(t)  ,Y(t-2)]    .  v    ,.       Wn    0       ,    v, 

A6   =   Var[Y?t-2)]         VA  ( t-s  )  [l-2p  .  (s  )  ] 

D]_    =    Xi(t-s-2){2qi(s-l)    +   bt_1  [2q±  (  s-1)  ]  } 


D2    =    X1(t-s-l) 


_2Pi(S+i)  ♦  bt_l{2q.(s-i)}  Zllntl])] 


C,    = 


1    "  Var[Y(t-2)l    U.  ( t-s-2  )q  .  (  s-2  )    -   X. ( t-s )p. (s+2 ) } 

=  Cov[Y(t)    Y(t-2)]         (t_s_2)fl_2D    (s)1 
L2  Var[Y(t-2)]         Ailt    s    ^m    ^i113^ 

C3   =  bt[X1(t-s-2){qi(s-l)    -   bt_x +  2Pi(s)>  -Xi(t-s-l)pi(s+l)] 


!j,    =   bt_1[Xi(t-s-l){q1(s-l)  -bt  +  2p±(s)}-  Xj_  ( t-s  )Pj_  (  s  +  1 )  ] 


and 


A    =    l/(l-pMY(t-l),Y(t-2)]) 
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